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Abstract 



This paper is concerned with samphng from the uniform distribution on H- 
colourings of the n-vertex path using systematic scan Markov chains. An H- 
colouring of the n-vertex path is a homomorphism from the n-vertex path to some 
fixed graph H. We show that systematic scan for //-colourings of the n-vertex path 
mixes in O(logn) scans for any fixed H. This is a significant improvement over the 
previous bound on the mixing time which was O(n^) scans. Furthermore we show 
that for a shghtly more restricted family of H (where any two vertices are connected 
■ by a 2-edge path) systematic scan also mixes in O(logn) scans for any scan order. 

On ■ Finally, for completeness, we show that a random update Markov chain mixes in 

0(n log n) updates for any fixed H, improving the previous bound on the mixing 
time from O(n^) updates. 



O 

O 1 Introduction 

_> : 

^ I Many combinatorial problems are of interest to computer scientists both in their own 
^ I right and due to their natural applications to statistical physics. Such problems can 
often be studied by considering homomorphisms from the graph of interest G to some 
fixed graph H. This is known as an if-colouring of G. The vertices of H correspond 
to colours and the edges of H specify which colours are allowed to be adjacent in an 
if-colouring of a graph. Let H = (C, E) by any fixed graph. We will refer to C as the 
set of colours (in the literature it is often referred to as the set of spins). Formally an 
iJ-colouring of a graph G = {V, Eg) is a function h -.V C such that {h{v), h{u)) G E 
for all edges (f , u) G Eq of G. For example if H is the graph in Figure [H then G = {a, b} 
and sites (in order to be consistent with existing literature, e.g. Weitz ^8], we will refer 
to elements of V as sites throughout this paper) assigned colour a in an i^-colouring of 
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Figure 1: The graph describing the independent sets model. Sites assigned colour a are 
"out" and sites assigned b are "in". 

a b 

Figure 2: The graph describing the Beach model. 
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G are permitted to be adjacent to sites assigned both a and 6, but sites assigned colour 
b can only be adjacent to sites assigned colour a. 

Due to the applicability of if-colourings to models in statistical physics, and for ease 
of analysis, if-colouring problems are often studied by restricting attention to a specific 
graph H. We now give a few examples of special cases of H that correspond to important 
if-colouring problems, if-colourings using the graph H from Figure [T] correspond to 
independent set configurations of a graph where sites assigned colour a are "out" and 
sites assigned b are "in" the independent set. It is usual to assign weight 1 to vertex a 
and some positive weight A > to vertex b in H. Independent sets (also known as the 
hard-core lattice gas model when using the weighted setting) is one of the most commonly 
studied type of if-colourings in theoretical computer science. Another well-studied case 
is when H is the g-clique, in which case i7-colourings correspond to proper g-colourings of 
the underlying graph. A proper g-colouring is a configuration where no two adjacent sites 
are permitted to be assigned the same colour. It is worth noting that proper g-colourings 
correspond to the g-state anti-ferromagnetic Potts model at zero temperature which is a 
well-studied model in statistical physics. Other well-known examples include the Beach 
model introduced by Burton and Steif [6] and the g-particle Widom-Rowlinson due to 
Widom and Rowlinson [30]. The graph corresponding to the Beach model is shown 
in Figure O The Beach model was originally introduced as an example of a physical 
system, with underlying graph Z"^, which exhibits more than a single measure of maximal 
entropy when d > 1. The g-particle Widom-Rowlinson model is a model of gas consisting 
of g types of particles that are not allowed to be adjacent to each other. The graph 
corresponding to the g = 4 case is shown in Figure [3] where the center vertex represents 
empty sites and each remaining vertex represents a particle. 

The problem of determining whether a graph has an if-colouring for a specific H has 
been well-studied and Hell and Nesetfil [20] gave a complete characterisation of graphs 
H for which this problem is NP-complete. In particular, they showed that if H has a 
loop or is bipartite then the problem is in P, and that the problem is NP-complete for 
any other fixed H. A complete dichotomy is also known for the problem of counting 
the number of if-colourings. This is due to Dyer and Greenhill p2] who showed that if 
H has at least one so-called nontrivial component then the counting problem is is #P- 
complete. Otherwise it is in P. A trivial component is a connected component which 
is either a complete graph with all loops present, or a complete bipartite graph with 
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Figure 3: The graph describing the 4-particle Widom-Rowlinson model. 




no loops present. They furthermore showed that the same dichotomy holds even when 
the underlying graph is of bounded degree, which is significant since in many physical 
applications the underlying graph tends to be of low degree. Interestingly the above 
characterisation for the decision problem does not hold for bounded degree graphs as 
shown by Galluccio, Hell and Nesetfil [17]. Despite the hardness of exactly counting 
the number of if-colourings it remains possible to approximately count the number of H- 
colourings of a graph as long as it is possible to sample efficiently from the (near) uniform 
distribution of if-colourings. This is due to a general counting-to-sampling reduction of 
Dyer, Goldberg and Jerrum [l2] that holds for any fixed H and any underlying graph. 

Sampling from the uniform distribution of if-colourings of a graph, which for this 
discussion we will denote by vr, is a challenging task and some results about the com- 
plexity thereof are known. Goldberg, Kelk and Paterson [TH| have shown that, if H 
has no nontrivial components, then the sampling problem is intractable in a complexity- 
theoretic sense. That is, they prove that there is unlikely to be any Polynomial Almost 
Uniform Sampler for if-colourings by reducing the problem of sampling from the (near) 
uniform distribution of if-colourings to the problem of counting independent sets in bi- 
partite graphs, which in turn is complete for a logically-defined subclass of #P (see Dyer, 
Goldberg, Greenhill and Jerrum [TT] for results about this complexity class). This does, 
however, not rule out the possibility of sampling from the uniform distribution of H- 
colourings of more restricted graphs, such as the n-vertex path, as we will be focusing 
on in this paper. This sampling task may be carried out by simulating some suitable 
random dynamics converging to vr. Ensuring that a dynamics converges to vr is generally 
straightforward, but obtaining good upper bounds on the number of steps required for 
the dynamics to become sufficiently close to tt is a much more difficult problem. Due to 
a lack of theoretical convergence results, scientists conducting experiments by simulating 
such dynamics are at times forced to "guess" (using some heuristic methods) the number 
of steps required for their dynamics to be sufficiently close to the desired distribution. See 
for example Cowles and Carlin [8] for a comprehensive review of some diagnostic tools 
used to empirically determine these convergence rates. By establishing rigorous bounds 
on the convergence rates {mixing time) of these dynamics computer scientists can pro- 
vide underpinnings for this type of experimental work and also allow a more structured 
approach to be taken. 

Analysing the mixing time of Markov chains for if-colouring problems is a well-studied 
area in theoretical computer science. There is a substantial body of literature concerned 
with inventing Markov chains for sampling from the uniform distribution of if-colourings 
of graphs and providing bounds on their mixing times. When an if-colouring corresponds 
to a proper g-colouring of graph with maximum vertex-degree A then Jerrum ^Tj, and in- 
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dependently Salas and Sokal [26], showed that a simple Markov chain mixes in 0{nlogn) 
updates when q > 2 A. This Markov chain makes transitions by selecting a site v and 
a colour c uniformly at random, and then recolouring site f to c if doing so results in 
a proper g-colouring of the graph. By considering a more complicated Markov chain 
Vigoda [27| was able to weaken the restriction on g to g > (11/6) A colours being suffi- 
cient for proving mixing in O(nlogn) updates. This remains the least number of colours 
required for mixing of a Markov chain on general graphs, however the number of colours 
can be further reduced for special graphs. For example, when the underlying graph is the 
square grid then Goldberg, Martin and Paterson [19] gave a hand-proof that q = 7 colours 
are sufficient for mixing in O(nlogn) updates by proving a condition called "strong spa- 
cial mixing". Achlioptas, Molloy, Moore and van Bussel [I] showed that q = 6 colours 
are sufficient for a Markov chain for proper colourings of the grid to mix in 0{nlogn) 
updates using a computational proof. As a final example for proper g-colourings Mar- 
tinelli, Sinclair and Weitz [24] showed that q = A + 2 colours are sufficient for 0{n\ogn) 
mixing when the underlying graph is a tree, improving a similar result by Kenyon, Mossel 
and Peres [22]. When H correspond to independent set configurations of a graph with 
parameter A (that is, the vertex labeled b in Figure [His assigned some positive weight A 
and a has weight 1) then A < is sufficient for O(nlogn) mixing as shown by Dyer 
and Greenhill [16] and independently Luby and Vigoda [23j (although the latter result 
is restricted to triangle- free graphs). When A < 4 these results include the A = 1 case 
which is of special interest to computer scientists since it corresponds to sampling from 
the uniform distribution on independent sets of the graph. Weitz [29] has recently im- 
proved the condition on A to A < (A — 1)^^^/ (A — 2)^ which notably includes the A = 1 
case for A = 5. When A > 6 and A = 1 then Dyer, Frieze and Jerrum [10] have shown 
that there exists a bipartite graph Go such that any so-called cautious Markov chain on 
independent set configurations of Gq has (at least) exponential mixing time (in the num- 
ber of sites of Go). A Markov chain is said to be cautious if it is only allowed to change 
the state of a constant number of sites at the time. This negative result was generalised 
to if-colourings by Cooper, Dyer and Frieze [7]. Their result applies to graphs H that 
are either bipartite or have at least one loop present, and is not a complete graph with all 
loops present (observe that for such an H the decision problem is in P and the counting 
problem is in #P as discussed above). In particular this result guarantees the existence 
of a A-regular graph Go (with A depending on H) such that any cautious Markov chain 
on the set of if-colourings of Go, and with uniform stationary distribution, has a mixing 
time that is at least exponential in the number of sites of Gq. 

While much is now understood about the mixing times of Markov chains, the types 
of Markov chains frequently studied by computer scientists fall under a family of Markov 
chains that we call random update Markov chains. We say that a Markov chain on the 
set of i7-colourings of a graph is a random update Markov chain when one step of the 
the process consists of randomly selecting a set of sites (often a single site) and updating 
the colours assigned to those sites according to some well-defined distribution induced 
by TT. The mixing time of a random update Markov chain is measured in the number of 
updates required in order for the Markov chain to be sufficiently close to tt. We point out 
that all the positive results described above are for random update Markov chains. An 
alternative to random update Markov chains is to construct a Markov chain that cycles 
through and updates the sites (or subsets of sites) in a deterministic order. We call this 
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a systematic scan Markov chain (or systematic scan for short). Systematic scan may 
be more intuitively appealing in terms of implementation, however until recently this 
type of dynamics has largely resisted analysis when applied to if-colouring problems. 
Perhaps some of the first analyses of systematic scan were due to Amit [3j and Diaconis 
and Ram who respectively studied systematic scan in the context of sampling from 
multivariate Gaussian distributions and generating random elements of a finite group. 
The mixing time of a systematic scan Markov chain is measured in the number of scans 
of the graph required to be sufficiently close to vr and throughout this paper it holds that 
one scan takes 0(n) updates where n is the number of sites of the graph. It is important 
to note that systematic scan remains a random process since the method used to update 
the colour assigned to the selected set of sites is a randomised procedure drawing from 
some well-defined distribution induced by vr. This paper is concerned with sampling 
from the uniform distribution of if-colourings of the ra- vertex path using systematic scan 
Markov chains. 

Only few results providing bounds on the mixing time of systematic scan Markov 
chains for sampling from the uniform distribution of if-colourings exist in the literature 
and almost all of them focus on proper g-colourings of bounded degree graphs. For 
general graphs systematic scan is known to mix in O(logn) scans whenever q > 2 A, 
where A is the maximum vertex-degree of the graph, by updating both end-points of an 
edge in each move. This is due to a recent result by Pedersen p5j which improves the 
polynomial in the q = 2A case from a result of Dyer, Goldberg and Jerrum [13] that 
is obtained by updating one site at the time. If the underlying graph is bipartite then 
a systematic scan mixes in 0(log?T,) scans whenever q > /(A) where /(A) (3 A as 
A —> oo and /? ~ 1.76. This result is obtained by a careful construction of the metric 
used in the coupling construction and is due to Bordewich, Dyer and Karpinski [4|. 
Furthermore, Dyer, Goldberg and Jerrum [Hj have shown that a systematic scan for 
proper 3-colourings of the n- vertex path mixes in 6(n^logn) scans when considering a 
systematic scan which updates a single site at the time using the Metropolis update rule. 
In the same paper it is also shown that systematic scan for if-colourings of the n- vertex 
path mixes in O(n^) scans for any fixed H and that a random update Markov chain for 
if-colourings of the n- vertex path mixes in O(n^) updates. The authors suggest, however, 
that both of these bounds are unlikely to be tight and we will significantly improve them 
both in this paper. 

In this paper we prove that systematic scan for if-colourings of the n-vertex path 
mixes in O(logn) scans for any fixed graph H, by updating a constant-size block of sites 
at each step. By constant-size we mean that the number of sites contained in a block is 
bounded independently of n. We do however allow the block-sizes to depend on H (since 
H is a fixed graph). We will present two different Markov chains in order to achieve 
this aim. In Section [2] we show that if if is a graph in which any pair of colours are 
connected by a 2-edge path then a systematic scan mixes for any order of a set of blocks, 
provided that the blocks are large enough. We will use a recent result by Pedersen [25], 
which is based on a technique known as Dobrushin uniqueness, in order to establish the 
mixing time of this Markov chain. In Section [3] we extend this result to all connected 
graphs H, although at the expense of imposing a specific order on the scan. The proof of 
mixing uses path coupling [5] in this case. Finally, for completeness, we give a proof that 
a random update Markov chain for if-colourings of the n-vertex path mixes in 0{nlogn) 
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updates for any fixed graph H. This result is presented in Section [H 



1.1 Preliminaries and statement of results 

Consider a fixed (and connected) graph H = {C,E) with maximum vertex-degree Ah- 
Let C = {1, . . . , g} be referred to as the set of colours. Also let = {1, . . . , n} be the set 
of sites of the n-vertex path and in particular let Vi be the set of sites with odd indices 
and V2 the set of sites with even indices. We formally say that an if-colouring of the 
n-vertex path is a function h from V" to C such that {h{i), h{i + l)) G E for all i G V\{n}. 
Let fl~^ be the set of all configurations (all possible assignments of colours to the sites) 
of the n-vertex path and Q be the set of all if-colourings of the n-vertex path for the 
given H. Define vr to be the uniform distribution on Q. If x G is a configuration and 
j E V is a site then Xj denotes the colour assigned to j in configuration x and for any 
set A C let xa = [Jy^zAi^v} be the set of colours assigned to sites in A. For colours 
c,d & C and an integer / let 0^!^ be the uniform distribution on if-colourings of the 
region of consecutive sites L = {vi, . . . ,vi} C V consistent with site Vi being adjacent to 
a site i E V \ L assigned colour c and site vi being adjacent to a site in \ L assigned 
colour d. Also let Df\{vj) be the distribution on the colour assigned to site Vj induced 



by D^^\. Observe that for s < / 



where Df'^ | f 1 = Ci, . . . , = is the uniform distribution on if-colourings of L condi- 
tioned on site Vi being assigned colour Ci, V2 colour C2 and so on until Vg being assigned 
colour Cg. 

Let M. be any ergodic Markov chain with state space and transition matrix P. By 
classical theory (see e.g. Aldous [2j) has a unique stationary distribution, which we 
will denote vr. The mixing time from an initial configuration x G f2 is the number of steps, 
that is applications of P, required for to become sufficiently close to vr. Formally the 
mixing time of M. from an initial configuration x G f2 is defined, as a function of the 
deviation e from stationarity, by 

Mix^(Al,£) = min{t > : dTv(^*(a;, ■), ^r) < e}, 

where 

dTv(^i,^2) = \y,\0i{T)-e2{i)\ = max \eM) - 02{A)\ 

i 

is the total variation distance between two distributions d\ and di on f2. The mixing time 
Mix(M, e) of M. is then obtained by maximising over all possible initial configurations 

Mix(Al, e) = max Mix^(A<, e). 

We say that M. is rapidly mixing if the mixing time of is polynomial in n and log(£~^). 

We study Markov chains that perform heat-bath moves on a constant number of sites 
at the time. For any configuration x G Q'^ and subset of sites A C we let Qa^x) be the 
set of configurations where the colours assigned to the endpoints of each edge containing 
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a site in A are also adjacent in H. A heat-bath move on A starting from configuration x 
is performed by drawing a new configuration from the uniform distribution on Qj\,{x). We 
would normally let fl be the state space of our Markov chains, however, if H is bipartite 
then we encounter a minor technical difficulty because the Markov chain may not be 
ergodic. We overcome this ergodicity issue by partitioning the state space as follows. 
If Ci and C2 are the colour classes of H then f2i = {x G f2 : xi G Ci} is the set of 
-ff-colourings where the first site of the path is assigned a colour from Ci. Observe that 
in fact each site in Vi is assigned a colour from Ci and each site in V2 is assigned a colour 
from C2. Similarly fl2 = E ■ Xi E Ci} is the set of if-colourings where the first site 
is assigned a colour from C2. Intuitively, fli and Q2 are the two connected components 
of fl and we will show (Lemma [TT]) that the constructed Markov chains are ergodic on 
either fli or ^2. To see that fli U ^2 contain all if-colourings of the n- vertex path it is 
enough to observe that if x E fl then any pair of adjacent sites of the n- vertex path must 
be assigned colours from opposite colour classes of if in x. We let fl^ be the relevant 
state space of the Markov chains in order to ensure ergodicity. In particular, if H is 
non-bipartite then fl^ = Q. Otherwise H is bipartite and we let fl^ be one of fli and 
fl2- This is the same partition used by Dyer et al. in [H]. See also Cooper et al. [7j for 
a discussion of this issue in the context of if-colourings. 

We are now ready to formally define the systematic scan Markov chains we will 
study in this paper and state our theorems. Let h = \Aj^\og{Ajj + 1)] + 1. Then 
let {01, . . . , Qmi} be any set of mi = [ri//i] blocks where each block consists of li con- 
secutive sites and IJfc!^i = V . For each block 9^ we define pt*^' to be the transition 
matrix on the state space flr^ for performing a heat-bath move on 6^. 

Definition 1. For any integer n we let A^AnyOrder be the systematic scan Markov chain, 
on the state space fl^, with transition matrix n^^pl^l 

It is worth pointing out that the following result holds for any order of the blocks, as 
is the case for all results obtained by Dobrushin uniqueness (see e.g. Dyer et al [1^). In 
Section [2] we will use a recent result by Pedersen p5] to prove the following theorem. 

Theorem 2. Let H be a fixed connected graph and consider the systematic scan Markov 
chain A^AnyOrder on the state space fl^. Suppose that H is a graph in which every two 
sites are connected by a 2-edge path. Then mixing time 0/ A^AnyOrder is 

Mix(AiAnyOrder,£) < A^(A| + 1) log(n£-l) 

scans of the n-vertex path. This corresponds to 0{n\ogn) updates by the construction of 
the set of blocks. 

Remark. Note that that each H for which Theorem [2] is valid is non-bipartite so flr^ = Q. 

Remark. Several well known graphs satisfy the condition of Theorem [21 for example 
Widom-Rowlinson configurations, independent set configurations and proper g-colourings 
for g > 3. The fact that an H corresponding to 3-colourings satisfies the condition of the 
theorem is particularly interesting since a lower bound of fl^n"^ logn) scans for single site 
systematic scan on the path is proved in Dyer at al. [Hj. This means that using a simple 
single site coupling cannot be sufficient to establishing Theorem [2] for any family of H 
including 3-colourings and hence we have to use block updates. 
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While many natural if-colouring problems belong to the family covered by Theorem [2l 
others (e.g. Beach configurations) are not included. We go on to show that systematic 
scan mixes in 0(log?T,) scans for any fixed graph H by placing more strict restrictions on 
the construction of the blocks and the order of the scan. Let s = 4g + 1, /3 = [log(2sg'* + 
and I2 = 2(3s. For any integer n consider the following set of m2 + 1 = \2n/l2\ 
blocks {00, ... , Bma} where 

Qk = {kf3s + 1, . . . , min((A; + 2)(3s, n)}. 

We observe that IJfc^o ©fc = V" by construction of the set of blocks. Furthermore note 
that the size of 9^2 is at least /3s and that the size of every other block is I2. 

Definition 3. For any integer n we let A^FixedOrder be the systematic scan Markov chain, 
on the state space Q^, which performs a heat-bath move on each block in the order 

©0, • • • , 0m2- 

In Section [3] we will use path coupling [5] to prove the following theorem, which 
improves the mixing time from the corresponding result in Dyer et al. [H] from O(n^) 
scans to 0(logr;,) scans. 

Theorem 4. Let H be any fixed connected graph and consider the systematic scan Markov 
chain A^FixedOrder on the state space The mixing time 0/ A^FixedOrder is 

Mix(AlFixedOrder,e) < (4sg^ + 2) log(rZ£-l) 

scans of the n-vertex path. This corresponds to 0{n\ogn) updates by the construction of 
the set of blocks. 

Remark. It is worth remarking at this point that Theorem [4] eclipses Theorem [2] in the 
sense that it shows the existence of a systematic scan for a broader family of H than 
Theorem [2] but with the same (asymptotic) mixing time. The result stated as Theorem [2] 
however remains interesting in its own right since it applies to any order of the scan. 
Following the proof of Theorem [2] we will discuss (Observation [H]) the obstacles one 
encounters when attempting to extend Theorem [2] to a larger family of H using the same 
method of proof. 

For completeness we finally consider a random update Markov chain for if-colourings 
of the n-vertex path. Let 7 = 2g* + 1 and define the following set of n + 57 — 1 blocks, 
which is constructed such that each site is contained in exactly sj blocks 

J {fc, . . . , min(fc + 57 — 1, n)} when /c G {1, . . . , n} 

. . . ,n + S'j — k} when /c G {n + 1, . . . , n + 57 — 1}. 

Definition 5. For any integer n we let AIrnd be the random update Markov chain, on 
the state space fl^, which at each step selects a block uniformly at random and performs 
a heat-bath move on it. 

In Section [4] we will use path coupling [5] to prove the following theorem, which 
improves the mixing time from the corresponding result in Dyer et al. [14] from 0{n^) 
updates to O(nlogn) updates. 
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Theorem 6. Let H be any fixed connected graph and consider the random update Markov 
chain tMrnd on the state space Q^. The mixing time 0/ TWrnd is 

Mix(MRND,^) < ^ ' ^ 

block-updates. This corresponds to 0{n\ogn) updates since the size of each block is at 
most S7 = 0(1). 

1.2 Review of proof techniques 

We now briefly introduce the techniques we will use to bound the mixing time of the 
above Markov chains. For technical reasons we extend the state space of the Markov 
chains as follows. Let Qf be the set of configurations where each site in Vi is assigned a 
colour from Ci and each site in V2 is assigned a colour from C2 (recall that Ci and C2 are 
the colour classes of H) . Similarly, is the set of configurations where each site in Vi 
is assigned a colour from C2 and each site in V2 is assigned a colour from Ci. Formally 

= {x eQ^ : C Ci, xv^ C C2} 

and 

fi^ = {a; G fi+ : Xy, C C2, Xy^ C Ci}. 

We then extend the state space of the Markov chains to where fi+ = fi+ if R is 
not bipartite and '^t. is one of fi^ or ^2 when if is bipartite. The extended Markov 
chains make the same transitions as the original Markov chains on configurations in 
fi^ and hence the extended chains do not make transitions from configurations in fi^ 
to configurations outside fi^. The stationary distributions of the extended chains are 
uniform over the configurations in Vt^ and zero elsewhere. This approach is standard 
and the mixing times of the original chains are bounded above by the mixing time of 
corresponding chain on the extended state space. 

For each site j G V", let Sj denote the set of pairs (x, y) G f2+ x f2+ of configurations 
that only differ on the colour assigned to site j, that is Xi = yi for all i j. Also let 
S = Ujev °f such pairs of configurations. 

1.2.1 Dobrushin uniqueness 

We will make use of a recent result by Pedersen [25] to prove Theorem [2] by bounding 
the infiuence on a site. For completeness we now summarise this result and at the same 
time point out how the construction of A^AnyOrder ensures that all required properties are 
satisfied. First note from the remark after Theorem [2] that each H that we consider is 
not bipartite and so f2+ = . Suppose that {9i, . . . , 9^} C ^ is a set of m blocks such 
that UfcLi = V and that each block 9^ is associated with a transition matrix P''^! on 
the state space For any configuration x G ^l'^ , P^^^{x, ■) denotes the distribution on 
configurations obtained from applying P^'^^ to x. Recall from the definition of A^AnyOrder 
that the set of blocks covers V as required and that each transition matrix P''^! represents 
performing a heat-bath move on 9^. It is furthermore required that each transition 
matrix P''^! satisfies the following two properties. 



9 



1. If p['^](a;, y) > then Xi = yi for each i E V \ 6^, and 

2. the distribution vr on fi"*" is invariant with respect to Pl'^l 

Pedersen [25] points out that if Pt^l is the transition matrix performing a heat-bath move 
on 0fc and tt is the uniform distribution on f2, as they both are in the case of AlAnyOrder, 
then both of these properties are satisfied. These properties ensure that the stationary 
distribution of any systematic scan Markov chain with transition matrix II^j^P''^] is n. 

We are now ready to define the parameter a denoting the influence on a site. For any 
pair of configurations {x,y) E Si let be a coupling of the distributions p[^l(x, ■) 

and P^''^{y, ■). We remind the reader that a coupling of two distributions tti and 712 on 
state space is a joint distribution on x such that the marginal distributions 
are tti and 112. We let {x',y') G ?/) denote that the pair of configurations {x',y') is 

drawn from the coupling \l/fc(x,y). We then let 

pi,j = Pvy)e*fe{^,j/)(4 ^ y'j) 

be the influence of site i on site j under 9^. The influence of i on j is thus the probability 
that site j G Qk is assigned a different colour in a pair of configurations drawn from the 
coupling '^k{x,y) where x and y differ only on the colour of site i. Finally the parameter 
a denoting the infiuence on any site is defined as 

a = max max > of . . 

Remark. Pedersen [25] actually defines a with a positive weight assigned to each site of 
the graph, however as we will not use the weights in our proof they are omitted from the 
above definition. 

The following theorem bounds the mixing time of a systematic scan Markov chain 
M.^ with transition matrix Yi^^^P^'^\ It is worth pointing out that, since the proof 
makes use of Dobrushin uniqueness, this upper-bound on the mixing time holds for any 
order of the blocks. 

Theorem 7 (Theorem 2, Pedersen [25]). If a < 1 then the mixing time of M.^ satisfies 

1 — a 

1.2.2 Path coupling 

In order to prove Theorems H] and [6] we will make use of path coupling [5] which is a well- 
known, and by now standard, technique for proving rapid mixing of Markov chains. The 
key idea of path coupling is to define a coupling for pairs of adjacent configurations where 
the set of all adjacent configurations connects the state space. We will say that a pair of 
configurations x,y E are adjacent if {x,y) E S. The path coupling machinery then 
extends the coupling to all pairs of configurations in the state space. For completeness 
we show that S connects the state space fl^. 

Lemma 8. The transitive closure of S is the whole o/^7;t x 1]+. 



10 



Proof. Recall that S = Uiev where 5*^ C f2+ x is the set of pairs (x, y) G x 
of configurations that differ only on the colour assigned to site j. To establish the lemma 
it is sufficient to, for any pair of configurations {x,y) G x ^i, to construct a path 
X = z'^ , , . . . , z"' = y such that {z^~^, z^) G Sj for each j G {!,..., n}. We define 2;-' for 
j G {1, . . . , n} as follows 

^ \yi for 1 < z < j 
* I Xj for j < z < n. 

Informally, configuration z^ agrees with configuration y from site 1 to j and with config- 
uration X from site j + 1 to n. 

By definition of the configurations z", . . . , n"^ it follows that z^~^ and only differ on 
the colour assigned to site j for each j G {1, . . . , ra}. Hence we only need to check that 
z^ G VL'^ for each j. U H is non-bipartite then fi+ = i7+ so z-' G n+ for each j G {1, . . . , n}. 
If if is bipartite then fl^ is one of or fij. Suppose without loss of generality that 
fl^ = fli- Then for each j G {1, . . .n} it holds by definition of flf that the colours Xj 
and yj must be from the same colour class of H and hence have z^ G Qf. □ 

Finally note that Ham(x, y) = 1 for any (x, y) E S where Ham(x, y) denotes the 
Hamming distance between configurations x and y. The following theorem is sufficient 
for our needs in this paper, and it is a special case of the general path coupling theorem 
proved by Bubley and Dyer [5]. 

Theorem 9 (Bubley, Dyer [5]). For all pairs {x,y) G S define a coupling {x,y) 1— > {x',y') 
of a Markov chain Ai on the state space Vf^. Suppose that there exists a constant < 
7 < 1 such that E [Ham(x', y')] < (1 — 7) for all pairs {x,y) G S. Then the mixing time 
of Ai satisfies 

Mix(M,e) < — -. 

7 

2 //-colourings on the path for a restricted family of H 

Recall that denotes the maximum vertex-degree of some fixed graph H and that 
li = [A|^log(A|f + 1)] + 1. The systematic scan Markov chain A^AnyOrder on has 
transition matrix E^j^P'*'' where P^'^' is the transition matrix for performing a heat-bath 
move on block 6^ from a set of mi = \n/li \ size h blocks covering the n- vertex path. We 
will prove Theorem [2], namely that A^AnyOrder mixes in O(logn) scans when H is a graph 
in which any two colours are connected via a 2-edge path. We will bound the mixing 
time of A^AnyOrder by bouudiug the influence on a site and begin by establishing some 
lemmas required to construct the coupling needed in the proof of Theorem [2l 

Lemma 10. Suppose that for any Ci,C2 G C there is a 2-edge path in H from Ci to C2. 
Then for any ci,C2,d G C and integer s' > 2 there exists a coupling ip{D^_^ ^, D^^ ^) of 
Z^^^ \ and D^^^ \ such that 

(') P^.'y)e^(</„<,i)(<^ ^ <^ < 1 - 4 and 
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Proof. By the condition of the lemma there exists some c' G C adjacent to both ci and 
C2 in H. We prove the statement by considering two cases on s. 

First suppose that s' = 2. By the condition of the lemma there is some colour d' 
adjacent to both c' and d in H. There are at most valid iZ-colourings of the sites 

(2) (2) 

Vi,V2 in either of the distributions -D^.^^ and -D^^^, and hence the colouring h, which 
assigns c' to Vi and d' to V2, has weight at least 1/A|^ in both. We construct a coupling 
such that 



The rest of the coupling is arbitrary. This gives the following bounds on the disagreement 
probabilities at Vi and V2 

which establishes (i) for s' — 2 and 

1 



which establishes (ii). 

Now suppose s' > 2. Let adj{c) denote the set of colours adjacent to c in and 
Uk the number of if-colourings on the sites ^4, . . . , v^/ consistent with ^3 being assigned 
colour k & C and Vs' being adjacent to a site (out side the block) coloured d. Also let Pc,k 
be the number of iJ-colourings oi Vi,V2,Vs assigning colour c to Vi and k to Vs without 
regard to other sites. Finally let Zi be the number of i7-colourings with positive measure 
in -De*] '^^^^^ assume without loss of generality that zi > Z2- 

There are at most Ah colours available for each site in the block which gives Pc,k < 
for any c,k & C and hence 

ceadj{ci) keC ceadj{ci) keC kec 

Now let H{d) be the set of all if-colourings with positive measure in D^^_^ \ that assign 
colour c' to site vi. Let h{c!) denote the size of this set. Now Pc,k > 1 for any c,k & C 
since there is a 2-edge path in H between any two colours and hence 



^(•^0 = ^Pc',knk > y^Wfc. 



fcec fcec 



Observe that, for any h G Hie'), h is at least as likely in -D)"* ,\ as in -D^i d since we 
have assumed zi > Z2 without loss of generality. We construct a coupling 'ip{D^^_^ \, D^^ 
of \ and \ in which for each h e H{c!) 

1 
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The rest of the coupling is arbitrary. Hence 



heH(c') 
^ h{c') 



> 



1 



using the bounds on Zi and /;,(c'). This completes the proof. □ 

We then use Lemma [lO] to bound the disagreement probabilities at each site of of the 
block when a pair of configurations are drawn from a recursively constructed coupling. 

Lemma 11. Suppose that for any ci,C2 G C there is a 2-edge path in H from c\ to C2. 
Then for all Ci,C2,d G C and integers V > 2 there exists a coupling '^{D),^\j^, D),^^ of 

D^^ ^ and D^^ ^ in which for j G {1, . . . , Z' — 1} 
and 



H 



I'-i 



Proof. We recursively construct a coupling '^{D),^'^, D).^'^ of -D^i^ ^cjd using the 
method set out in Goldberg et al. [Ej as follows. Firstly /' = 2 is the base case and we 
use the coupling from Lemma [lOl For /' > 3 we construct a coupling using the following 
two step process. 

1. Couple D),_^ and D),J^{vi) greedily to maximise the probability of assigning the 
same colour to site Vi in both distributions. 

2. If the same colour c was chosen for Vi in both distributions in step 1 then the set of 
valid if-colourings of the remaining sites are the same in both distributions. Hence 
the conditional distributions -D^i d I = c and -0^2 d I = ^ are the same and 
the rest of the coupling is trivial. Otherwise, for all pairs {c[, Cg) of distinct colours 
recursively couple -D^i d I ~ '^'i ~ -^c' ~d^^ -^cl I "^i = ^2 = -D^! which is a 
sub problem of size I' — 1. 

This completes the coupling construction. 

Now for j G {1, — 1} we prove by induction that 
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The base case, j = 1, follows from Lemma [TOl since we couple the colour at site vi greedily 
to maximise the probability of agreement at vi in the first step of the recursive coupling. 
Now suppose that ([I]) is true for j — I then 



< 1 



A2 



where the first inequality uses Lemma fTOl and the second is the inductive hypothesis. 
The j = I' case is similar. 



r' r' 

X Pr 



{x' ,y')£-f{D'^f d'^^ 7^ ?/f2) 



1 \ ''^^ / 1 \ / 1 ^ '''^^ 



< 1- ^ 1 



A2 ( \ - A 2 ; \ - A 2 

where the inequality uses Lemma fTOl and □ 

We can then use the coupling constructed in Lemma [H] to construct a coupling 
y) of the distributions P^''^(x, ■) and P^''^{y, ■) for each pair of configurations (x, y) E 
Si. We summarise the disagreement probabilities in this coupling in the following corol- 
lary (of Lemma [TTl) . 

Corollary 12. For any sites i,j G V let d{i,j) denote the edge distance between them 
and suppose that for any c,d E C there exists a 2-edge path in H from c to d. Then 

( 1 

(1 — if i is on the boundary of 9^ and d{i,j) < h 

^1 — if i is on the boundary of 0^ and d{i,j) = li 

otherwise. 
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Figure 4: A block Qk of length li. 







h -dj + l 










j 





Proof. For each block 9^ we need to specify a coupling 2/) of the distributions 

P^''^(x, ■) and P^''^{y, ■) for each pair of configurations {x, y) e Si and each i eV . Trivially 
if z G 0fc then the set of if-colourings with positive measure in each distribution is the 
same and the same if-colouring can be chosen for each distribution. The same holds 
when i is not on he boundary of 0^. 

Suppose that i is on the boundary of 0^. Let the other site on the boundary of 0^ 
be coloured d in both x and y and hence p['^l(x, ■) = D^l^\ and P^^\y, ■) = Dyfd- t^e^ 

let \E'fc(a^,l/) = ^ {D'^xl!di ^yl^d) which is the coupling constructed in Lemma [TT] and gives 
the stated bounds on the disagreement probabilities. □ 

Remark. It is important to note that, given distinct sites i and i' both on the boundary 
of 0fc, we may use a different coupling for p^j and pf/ j. This is the case since, by definition 
of p, the coupling may depend on both the block and the two initial configurations x and 
y (which in turn determine i). Since x and y only differ on the colour assigned to site 
the coupling is defined to start from the site in 0^ immediately adjacent to i, and thus 
we can use a different coupling for p^j and pf, ■ . 

The following technical lemma is required in the proof of Theorem [2l 

Lemma 13. For any < p < 1 and j, I G Z+ where I > 2j 

Proof. 

pi + pi-j+i _ pj+i _ pi-j = _ _ pi-j(^i _ 

= [pi - pi-^){l - p) 
= p'{l-p^-^^){l-p) > 

since < p < 1 where the last equality uses the fact I > 2j. □ 

We are now ready to prove Theorem [21 

Proof of Theorem [3 We will show that a < 1 and then use Theorem [7] to obtain the 
stated bound on the mixing time. Consider some site j G 0fc and let dj denote the 
number of edges between j and the nearest site z ^ 0^ on the boundary of 0^. Then the 
distance to the other site, i', on the boundary of 0^ is li — dj + 1 as shown in Figure [H 
Notice that dj < [/i/2]. By Corollary [12] we have 

ph<[^-^) -dpf,,<i.^,,,^i-^j +'^-^l'-A^j ■ 
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k , ' 



Now let 

be the influence on site j. Then 



a = max max a,- k 

k jGOfe 



< max < max < \ I - -nr ) +1-^ ^,1-^ + 1 



Since dj < [/i/2] the conditions of Lemma [I3] are satisfled for 2 < dj < [/i/2] — 1. In 
particular taking dj = [/i/2] — 1, which satisfies the requirements, gives 



1-^1 +(1-— 1 >|1-— 1 + 

and hence 



max + ^< i__ + 



2 / \ h-l 



< 1-^ + 1 



h-i 



which gives 



a < 1 - ^ + 1 



1 / 1 



1-^+1 



A|, V K 
1 1 

< 1 - ^ + 



rA|,log(A|,+l)l 



1 



1 

aMaTTT) 



by substituting the definition of li and using the fact (1 — l/xY < e ^ for a; > 0. The 
statement of the theorem now follows by Theorem [7l □ 

We now take a moment to show that we are unable to use Theorem [7] to prove rapid 
mixing for systematic scan on if-colourings of the n- vertex path for any H that does not 
have a 2-edge path between all pairs of colours. This motivates the use of path coupling 
(at the expense of enforcing a specific scan order) in the subsequent section. 

Observation 14. Let H = (C, E) be some fixed and connected graph in which there is 
no 2-edge path from C\ to C2 for some distinct Ci,C2 G C. Then for any set of m blocks 
with associated transition matrices P'^^ . . . P'™"] and any coupling '^k{x, v) for 1 < k < m 
and {x, y) G Si we have a > 1 in the unweighted setting. 
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Proof. Recall Si C fi+ x Q'^ where fi^ is the set of all configurations (except when H is 
bipartite in which case fi^ is one of Qf and fi^ as described earlier). Note in particular 
that any given configuration in ^2;^; need not be an if-colouring of the n- vertex path. Also 
recall that p^ j is the maximum probability of disagreement at j when drawing from a 
coupling starting from two configurations {x,y) G Si. Let x be any proper if-colouring 
with Xi = ci and y be the configuration with yj = xj for j i and yi = C2 (If H is 
bipartite then C2 is from the same colour class of if as ci). Note that y is not a proper 
if-colouring as both edges (yj_i,?/j) ^ E and (?/i,yj+i) ^ E, otherwise the 2-edge paths 
(xj,Xj+i = yi^i,yi) and (xj,Xj_i = yi-i,yi) would exist in H. However, x and y are both 
configurations in f2+ and they only differ at the colour of site i so (x, y) is a valid pair in 
Si. 

Now assume that a < 1. Fix some block 9fc = {i + 1, . . . ,i + 1} of length / and 
let Pl'^l be the transition matrix associated with 0^. Also let 'i!k{x,y) be any coupling 
of p['^l(a;, ■) and p['^l(?/, ■). Since a < 1 it must hold that p^j < 1 for each j G 6^. In 
particular pfj+i = Pi^{x' ,y')e'S'k(x,y){x[_^_-^ ^ Vi+i) < 1 and so (letting adj{c) denote the set 
of colours adjacent to c in H) the set adj{ci) fl adj{c2) must be non-empty since there 
is a positive probability of assigning the same colour to site i + 1 in both distributions. 
However take any d G adj{ci) fl adj{c2), then (ci,(i, C2) is a 2-edge path from ci to C2 in 
H contradicting the restriction imposed on H and hence a > 1. □ 

Remark. It remains to be seen if adding weights will allow a proof in the Dobrushin 
setting for classes of H not containing 2-edge paths between all colours. However, this 
can be done using path coupling as we will show in section [3l 

3 //-colouring on the path for any H 

Recall that A^FixedOrder IS the systematic scan on defined as follows. Let s = 4g + 1, 
P = [log(2sg'' + l)]q^ and I2 = 2(3s. Then A^FixedOrder is the systematic scan which 
performs a heat-bath move on each of the m2 + l = \ 2n/l2\ blocks in the order 9o, . . . , 
where 

= {k(3s + 1, . . . , min((A; + 2)(3s, n)}. 

Note that the size of is at least (3s and that every other block is of size I2. We will 
prove Theorem m which bounds the mixing time of A^FixedOrder- Our method of proof will 
be path coupling [5] and we begin by establishing some lemmas required to define the 
coupling we will use in the proof of Theorem [H The constructions used in the following 
two lemmas are similar to the ones from Lemma 27 in Dyer et al. |14| . 

Lemma 15. If H is not bipartite then for all Ci,C2 G C there is an s-edge path in H 
from Ci to C2. 

Proof. Let c G C be some site on an odd-length cycle in H and let di be the shortest 
edge-distance from ci to c and d2 the shortest edge-distance from c to C2. We construct 
the path as follows. Go from Ci to c using di edges. If di + d2 is even then go around the 
cycle using an odd number q' < q of edges. Go from c to C2 in ^2 edges and observe that 
the constructed path is of odd length. Also the length of the path is at most 

di + d2 + q' < 3q. 
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Finally go back and forth on the last edge on the path to make the total length s. □ 

Lemma 16. // H is bipartite with colour classes Ci and C2 then for all ci G Ci and 
C2 € C2 there is an s-edge path in H from C\ to C2. 

Proof. Go from ci to C2 in at most q — l edges and note that the number of edges is odd. 
Then go back and forth on the last edge to make the total path length equal to s. □ 

For completeness we present a proof that FixedOrder is ergodic on Q^. 

Lemma 17. The Markov chain FixedOrder is ergodic on fi^. 

Proof. Let -PFixedOrder be the transition matrix of FixedOrder- We need to show that 
TWpixedOrder Satisfies the following properties 

• irreducible: -PpixedOrderl^; ?/) > f^"^ ^^^^ P^^^ {^jU) ^ ^ ^ind some integer 
t > 

• aperiodic: gcd{t : -PpixedOrderl^' ^) > 0} = 1 for each x E ^l^. 

In an application of ^FixedOrder a heat-bath move is made on each block in the order 
60, ... , Qm- A heat-bath move on any block starting from an if-colouring has a positive 
probability of self-loop which ensures aperiodicity of the chain. To see that A^FixedOrder 
is irreducible consider any pair of if-colourings G fi^ x Q^. We exhibit a sequence 

of if-colourings x = cr°, . . . , o"™^"'"^ = y such that = a^^^ for each < /c < m2 and 
i eV \ Qk- Using this sequence we observe that -PFixedOrder(a;, y) > since, for each < 
k < m2, performing a heat-bath move on block to cr^ G results in the if-colouring 
cr'^"'"^ G flr^ with positive probability. Recall that 9^ = {k^s + 1, . . . , min((A; + 2)/5s, n)}. 
Then let a'' be given by 

if 1 < z < min((fc + 2)ps - s + l,n) 
if {k + 2)(3s + l <i<n 
if {k + 2)f3s -s + 1 <i< mm{{k + 2)(3s, n) 

where p{j) is the j-th in the sequence of colours on the s-edge path in H between p{0) = 
y{k+2)i3s-s+i and p{s) = X(^k+2)/3s+i given by Lemmas [15] and [16] (since p{0) and p{s) are in 
opposite colour classes of H in the bipartite case) respectively. □ 

The following lemma is an analogue of Lemma 13 in Goldberg et al. 1 1 9] . 

Lemma 18. For any ci,C2,d G C and positive integer s > s such that both -D^i d 
-^C2 d ^'"^ non-empty there exists a coupling ip{D^^^\, d'"^^^ ^/-^cii -^ca c« that 

Proof. For ease of notation let Di denote -0^^ and D2 denote -Dc2,d- ^'^^ ^' ^ 
be the number of iY-colourings on Vs+i, ■ ■ ■ ,Vs' consistent with Vg being assigned colour 
k E C and Vgi adjacent to a site (not in L) coloured d. If both s' = s and k is adjacent to 
d in H then = 1. If s' = s but k is not adjacent to d in H then = 0. The following 



p{i-{k + 2)ps + s-l] 
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definitions are for i G {1,2}. Let li{k) be the number of if-colourings on Vi,...,Vs 
assigning colour k to site Vs and consistent with t>i being adjacent to a site (not in L) 
coloured q. We also let Zi be the set of if-colourings on L with positive measure in Di 
and Zi be the size of this set. Note that Di is the uniform distribution on Zi so for each 
X & Zi Ptjj.{x) = 1/zi. For each k & C let Zi{k) C Zi be the set of if-colourings with 
positive measure in Di that assign colour k to site Vs and let Zi{k) be the size of this set. 
Note that li{k)nk = Zi{k) and J2k^ii^) ~ ^i- Let C* = {k ^ C \ Zi{k) > 0} be the set of 
valid colours for Vs in A and let C* = Q U 

We define a coupling ip of -Di and D2 as follows. Assume without loss of generality 
that Zl > Z2. We create the following mutually exclusive subsets of Zj. For each k G 
C* let f{k) = min(zi(A;),Z2(fc)) and let Fi{k) = {a(^)(l), . . . , ct(^)(/(A;))} C Zi{k) be 
any subset of if-colourings in Zi assigning the colour k to site Vg- Also let F2{k) = 
{t^''\1), . . . , r'^'^^f (k))} C Z2{k) and observe that Fi{k) and F2{k) are of the same size. 
We then construct ip such that for each k E C* and j G {1, . . . , f{k)} 

Pr(.^,oe^(x' = a(^)(j),2/' = rW(j)) = -- 

The rest of the coupling is arbitrary. For example let Ri = Zi \ (IJfcgc* -^i(^)) 
of (valid) if-colourings not selected in any of the above subsets of Zi and the size of Ri 
be Tj, observing that ri > r2. Let R'l = {(t(1), . . . ,a{r2)} C Ri and enumerate R2 such 
that R2 = {r(l), . . . , r(r2)}. Then for 1 < j < ra let 

Pr(x'y)e^(^' = ^(j)>l/' = ^(j)) = — • 
Finish off the coupling by, for each pair {a E Ri\ R'i,t E Z2) of if-colourings, letting 

Pr(^/y)g^(a;' = (r,y' = t) = ^— . 

Z1Z2 

From the construction we can verify that the weight of each colouring x E Zi in the 
coupling is 1/zi and the weight of each colouring y E Z2 is 

1 _^ Zl - Z2 _ 1 
Zl Z1Z2 Z2 

since the size of -Ri \ R[ is Zi — Z2. This hence completes the construction of the coupling. 
We will require the following bounds on li{k) for each k E C* 

1 < k{k) < q\ (2) 

There are at most q colours available for each site in the block and hence at most valid 
if-colourings of Vi, . . . ,Vs which gives the upper bound. We establish the lower bound 
by showing the existence of an s-edge path in H from both Ci and C2 to any k E C*. 
Suppose that H is non-bipartite, then Lemma [15] guarantees the existence of an s-edge 
path in H between any two colours in H, satisfying our requirement. 

Now suppose that H is bipartite with colour classes Ci and C2. Without loss of 
generality suppose that ci G Ci. Since both Di and D2 are non-empty there exists a 
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(2s' + 2)-edge path in H from ci to C2 (via d) so C2 G Ci. Let k E C then k E C2 since 
there is an s-edge path in H from ci to /c and s is odd. Lemma fT6l implies the existence 
of an s-edge path between each c G Ci and each k E C2 which establishes (El). 
Using (I2I) to see that < f{k) < q^n^ for each k E C* we have 

k&C* 

fik) 



> 



E 



kec* ^ 



1 

which completes the proof. □ 

Lemma 19. For any Ci, C2, d E C and any positive integer I' < I2 such that both ^ and 
^C2 d ^'^^ non-empty there exists a coupling ^ of D),^ \^ and D^J^ in which for 1 < j < I' 



Proof. We construct a coupling \1/(Z}^^ ^, D^^) of -D);^ ^ and -0^2 d using the following two 
step process, based on the recursive coupling in Goldberg et al. [T9] . 

1. If /' < s then couple the distributions any valid way which completes the coupling. 
Otherwise, couple D);^ ^(ws) and D^j^lvs) greedily to maximise the probability of 
assigning the same colour to site Vs in both distributions. Then, independently in 
each distribution, colour the sites Vi, . . . ,Vs-i consistent with the uniform distri- 
bution on if-colourings. Note that it is possible to do this since we obtained the 
colour for site Vg in each distribution from the induced distribution on that site. If 
I' = s this completes the coupling. 

2. If the same colour is assigned to Vg then the remaining sites can be coloured the same 
way in both distributions since the conditional distributions are the same. Other- 
wise, for all pairs {c[, c'2) of distinct colours the coupling is completed by recursively 



constructing a coupling of D). ^ | f s = c'l 



D^J^ and 



^co.d \ — (^2 



This completes the coupling construction and we will prove by strong induction that for 
JG{1,...,/'} 

/ 1 \ J 

p^.'y).>i'(<;..<> ) ^y'^^^^y^-js) ■ (3) 

Firstly the cases 1 < j < s — 1 are established by observing that [j/sj = and the 
probability of disagreement at any site is at most 1. The case j = s is established in 
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Lemma [m Now for s < j < I', suppose that ([3]) holds for all positive integers less than 
j. Let S- = {s, 2s, ... } and define the quantities j_ and aj by j_ = max{x E \ x < 
j} = ajs observing that I < j — j- < s. Now 



Observe that for any pair {c[,C2) of colours, if the probabilities of assigning c'^ to Vj_ in 
-D^^^l^ and to fj_ in -D^l d ^'^^ both non-zero then the distributions -D^! and D^! 
are both non-empty and hence, using Lemma [IE for /' — j„ > s and upper-bounding 
probability of disagreement by one otherwise, we get 



< 



4) (V,-=s(i-i/g1 + i.w-^s) 



< 



if J -j- = s 
if J - J- 7^ s 



(4) 



where last inequality is the inductive hypothesis since j_ < j. 

First consider the case j — j_ 7^ s in which we have j- + b = j for some 1 < 6 < s — 1. 
Then 

— T ttjS — 1 

= = aj — 1 < aj 

and so for 1 < 6 < s — 1 



ajS 




j- 


- S - 




s 



j- + b 




j- 


s 




s 



which implies that 



3- 




i 


s 




s 



Now suppose j — j- = s which substituting for j_ gives 

1. 



j- 




j -s 




i 


s 




s 




s 



Substituting ([5]) and ((6|) in ([4]) completes the proof. 



(5) 

(6) 
□ 
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We are now ready to define the coupling of the distributions of configurations obtained 
from one complete scan of the Markov chain A^FixedOrder- The coupling is defined for pairs 
{x,y) G Si. We will let {x',y') denote the pair of configurations after one complete scan 
of A^FixedOrder Starting from (x, y) and let (x^, y^) be the pair of configurations obtained 
by updating blocks Qq, . . . ,Qk-i starting from {x,y) = (x^,?/"). Observe that {x',y') is 
obtained by updating block 9^2 from the pair {x"^^,y"^^). 

The coupling for updating block 6^ is defined as follows. Let i and i' be the sites on the 
boundary of 6^. The order of the scan will ensure that at most one of the boundaries is a 
disagreement in (x*^, y'^), so we only need to define the coupling for boundaries disagreeing 
on at most one end of 9^; suppose without loss of generality that xf, = yf, = d for some 
d & C. Firstly, if xf = y^ then the set of valid configurations arising from updating 9^ is 
the same in both distributions and we use the identity coupling. 

Otherwise xf ^ y^. If H is not bipartite then Lemma fTSl implies the existence of a 
(m2 + l)-edge path between both xf and d and between yf and d. If H is bipartite then 
xf and are in the same colour class but d is in the opposite colour class of H since I2 
is even. Lemma [TBI implies the existence of a (m2 + l)-edge path between both xf and d 
and between yf and d. Hence both distributions D^^k\ and D^k\ are non-empty and we 

obtain {x^^-^ , y^'^^) from {D^''^^ , D^'''^^ ) which is the coupling constructed in Lemma [T9l 

Note that if = m2 (i.e. the block is the last block which may not be of size I2) then both 
distributions remain (trivially) non-empty. For ease of reference we state the following 
corollary of Lemma [T9l 

Corollary 20. For any two sites v,u G V let d{v,u) denote the edge distance between 
them. For any block 9^ let i and i' he the sites on the boundary of 9^ and suppose that 
^i' = Vi' = d for any d E C. Obtain {x'''^^ , y^^^) from the above coupling. Then for any 
J e 9fc 



otherwise. 



Lemma 21. For any positive integers s, k,x 

sk 



Proof. 



^ V X 



< sx. 



sk / l\\.~s\ fe-1 / -, \ 7 / 1 \ fe 



XI \ >' / \ ry I ^^^^^ \ np 

j=\ ^ / \ / \ / \ 

□ 

The following lemma implies Theorem [4] by Theorem [9] (path coupling). 

Lemma 22. Suppose that (x, y) G Si and obtain (x', y') by one complete scan o/A^FixedOrder- 
Then 

E [Ham(x', 1/')] < 1 - - — ^ . 
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Figure 5: Site i is on the boundary of 6a and is not contained in any block 6a' with 
a' < a. 



af3s + 1 



(a + 2)/3s 



Proof. First suppose that i is not on the boundary of any block and that 6^ is the 
first block containing i. In this case Corollary [20l gives us Pr(x^"'"^ 7^ vt^^) = and so 
Ham(x', y') = 0. 

Now suppose that i is on the boundary of some block 6a. Recall the definition of a 
block 

6fc = {k(3s + 1, . . . , mm{k(3s + 2(3s, n)}. 

If i is also contained in a block 6a' with a' < a then Corollary [20] gives Pr(xf+^ ^ 
Vi = ^iid hence Ham(x', y') = 0. 

If site i is not updated before 6a then z = (a + 2)/3s + 1 as shown in Figure [5] and the 
disagreement percolates through the sites in 6a during the update of 6a. Using Corollary 
[20I we have for j G 6a 

in particular, the sites in 6a \ 6a+i = {a(3s + 1, . . . (a + will not get updated again 

during the scan and hence for j G 6a \ 6a+i 

y {a + 2)0s + l-j ^ 

PK4^%)<(l-^) ^ • (8) 

Now consider the update of any block 6^ from the pair of configurations {x^^y'^) 
where k > a. There cannot be a disagreement at site (/c + 2)/3s+ 1 since that site has not 
been updated (and it was not the initial disagreement) so the only site on the boundary 
of 6fc that could be a disagreement in {x^,y'') is k^s. Hence from Corollary [20l for 
j G {k(3s + 1, . . . , min((A; + 2)/3s, n)} 

Pr(xf ^ ^ ^ I xl,, ^ yt,,) < (1 - ^) ^ ■ (9) 
We show by induction on k that for a + 1 < A; < 1712 

/ 1 \ f3{k-a) 

The base case, /c = a + 1 follows from ([7]) since j = kfis = (a + l)/?s = a/?s + /5s G 6a. 
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Now suppose that f fTOl ) is true for /c — 1. Then 

(fc-l)/3s 




using the inductive hypothesis and ([9]). 

Now for each site j > (a + + 1, that is site j is updated at least once following 
block 9a, write j = kj(3s + hj with 1 <hj < (3s where kj denotes is the index of the block 
in which j is last updated. 

Pr(x;. ^ y'^) = Pr(xf ^ yf^') 

We can then apply ((91) to the first component of the product since j G {fcj/^s+l, . . . , min(/cj/3s+ 
2/?s, n)} and (fTOll to the second since a + 1 < j < m2 to get 

p.(4^y;)<(i-^) (i-^j . 

Then, using linearity of expectation and (l8|), we have 
E[Ham(x',y')]=5ZPrK-^y;) 



i6ea\ea+i i6Ufc>a+iefe 




24 



where the last inequality uses Lemma ED and the sum of a geometric progression. Sub- 
stituting the definition of /3 and using the fact (1 — 1/xY < for a; > we get 



E[Ham(x',i/')] < (l"- ^q' + ^ ' 



riog(2sg^+l)lg'' 

sq^ 



so* sq'^ 



griog(2sq-+l)l ' griog(2sg=+l)l(^]^ _ g-riog(2sg=+l)r 



sq ^ sq 



griog(2sq-+l)l griog(2sg=+l)l _ 1 

^ sq^ ^ sq^ 



2sq' + 1 2sq' 

1- ' 



Asq' + 2 

which completes the proof. □ 



4 //-colouring using a random update Markov chain 

Recall that the random update Markov chain A^rnd on fl^ is defined as follows. We 
again let s = 4g + 1 and we define 7 = 2q^ + 1. We then define a set of n + S7 — 1 blocks 
of size at most 57 as follows. 

J {k, . . . , min(A; + 57 — 1, n)} when k G {1, . . . ,n} 

. . . ,n + S'j — k} when k E {ra + 1, . . . , n + 57 — 1} 

By construction of the set of blocks each site is adjacent to at most two blocks and 
furthermore each site is contained in exactly S7 blocks. One step of A^rnd consists of 
selecting a block uniformly at random and performing a heat-bath update on it. We will 
prove (using path coupling) Theorem [6] namely that A^rnd mixes in O(nlogn) updates 
for any H. 

We begin by defining the required coupling. For a pair of configurations (x, y) G Si 
we obtain the pair {x',y') by one step of A^rnd- That is we select a block uniformly at 
random and perform a heat bath move on that block. We can again use Lemma [TOl from 
Section [3] to construct the required coupling for updating block 9^ since the definition 
of s is the same in both Markov chains. If i is not on the boundary of 0^ then the sets 
of valid if-colourings of 9^ are the same in both distributions and we use the identity 
coupling. If i is on the boundary of 0^ then we let the other site on the boundary be 
coloured d in both x and y. We then obtain {x',y') from \l/(Z)^'^^j, D^*^'') which is the 
coupling constructed in Lemma [191 The disagreement probabilities are summarised in 
the following corollary (of Lemma fT9ll. 

Corollary 23. For any two sites v,u E V let d{v,u) denote the edge distance between 
them. Suppose that a block 6^ has been selected to be updated. For any pair {x, y) G Si 
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obtain {x', y') from the above coupling. Then for any j G 9^ 

Py^^x'- 7^ y') < I ^ 9^) ° if i is on the boundary of 
I otherwise. 

The following lemma implies Theorem [6] by Theorem [9] (path coupling). 
Lemma 24. Suppose that {x,y) G Si and obtain {x',y') by one step o/TMrnd- Then 

E[Ham(x',y')] < 1 - ' 7- 

n + 2sq'^ + s — 1 

Proof. There are 57 blocks containing site i and if such a block is selected then Ham(x', y') - 
0. There are at most 2 blocks adjacent to site i and if such a block is selected then the 
discrepancy percolates in the block according to the probabilities stated in Corollary [23l 
This leaves n + S'y — l — sj — 2 = n — 3 blocks that leave the Hamming distance unchanged. 
Hence, using Lemma EH we have 



'ys |_ ^ J ' 



n — 3 



n — 1 2sq 
< 7 + 



n + S7 — 1 



n + s-y — 1 n + 57 — 1 
2sq^ + n — 1 ^ 



2sq^ + n — \ + s 2sq^ + n — 1 + s 

by substituting the definition of 7. □ 
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